A Symmetry Breaking Model for X Chromosome Inactivation 
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In mammals, dosage compensation of X linked genes in female cells is achieved by inactivation 
of one of their two X chromosomes which is randomly chosen. The earliest steps in X-inactivation 
(XCI), namely the mechanism whereby cells count their X chromosomes and choose between two 
equivalent X, remain mysterious. Starting from the recent discovery of X chromosome colocalization 
at the onset of X-inactivation, we propose a Statistical Mechanics model of XCI, which is investi- 
gated by computer simulations and checked against experimental data. Our model describes how 
a 'blocking factor' complex is self-assembled and why only one is formed out of many diffusible 
molecules, resulting in a spontaneous symmetry breaking (SB) in the binding to two identical chro- 
mosomes. These results are used to derive a scenario of biological implications describing all current 
experimental evidences, e.g., the importance of colocalization. 



X chromosome inactivation (XCI) is the phenomenon 
in female mammal early embryo cells by which one of 
their two X chromosomes, randomly chosen, is tran- 
scriptionally silenced, and epigenetically inherited in de- 
scendants, to equalize the dosage of X genes products 
with respect to males [l|, 0, El- Crucial aspects of this 
chromosome- wide stochastic regulatory mechanism, nec- 
essary to survival, still elude comprehension despite be- 
ing the focus of substantial interest for their important 
scientific and medical implication (see [l], 0, 0] and Ref.s 
therein). Starting from the important discovery of X 
colocalization during XCI establishment d S], in this 
paper we propose a Statistical Mechanics model of the 
early steps of XCI. 

Actually, XCI is a multistep process involving 0, 0, [3] : 
"counting" the number of the X chromosomes of the 
cell, "choice" of the inactive X, its silencing and main- 
tenance. Silencing and maintenance start being under- 
stood: the former is induced by the action of the Xist 
gene transcript, and maintenance of the inactive state 
is a paradigm of epigenetic inheritance [3j, [6j . Counting 
and choice are, instead, in many respects still mysteri- 
ous, though it is well established they are controlled by 
yet unknown sites located within a 1 Mb region on the 
X, the X-chromosome-inactivation center (Xic) , contain- 
ing several genes and regulators [J, 0] , such as the Xist 
gene. We also know that cells having a normal number 
of autosomes (non sex chromosomes) and extra copies of 
the X chromosome have only one active X, irrespective 
of the number of X's [3, H 0] . 

This biological scenario suggests 0, 0, Q that "con- 
trolling factors" for counting and choice derive from au- 
tosomes and interact with cis-acting regulatory sequences 
on the X chromosomes, whose position within the Xic is 
still unknown. Current models postulate the existence 
of a "blocking factor" (BF) ^MM, 

a complex made of 
X and autosomal factors, binding to the Xic of just one 
chromosome per diploid cell preventing its inactivation, 
as the second unprotected Xic in a female cell is inacti- 



vated bydefault. Multiple factors models were proposed 
as well [8|, |9(. These models do not account, though, for 
the discovery that colocalization, i.e., a physical proxim- 
ity, occurs between X chromosomes at the onset of XCI, 
specifically in the Xic region, a phenomenon shown to be 
necessary for XCI dHI. To comprehend the role of Xic 
colocalization, a description of the system is demanded 
that considers the influence of the spatial configuration 
on the interaction of the Xic with the BF. 

The molecular nature of the blocking factor is itself 
still unknown: it must be a unique entity to perform 
its function, though, several considerations (e.g., degra- 
dation, over-production problems) exclude the possibil- 
ity that it is a single protein or RNA molecule. On the 
other hand, if a diffusible controlling factor is produced 
in several copies that can statistically reach the target, 
asymmetric binding to two equivalent chromosomes must 
be explained. While the BF can be envisaged as a unique 
complex formed by autosomally derived molecules, why 
only one is formed is not understood. 

We introduce a Statistical Mechanics schematic model 
of the diffusible "controlling factors" theory of X inacti- 
vation and we explain how a supermolecular complex can 
be self-assembled and why only one is formed out of many 
molecules, resulting in a spontaneous symmetry breaking 
(SB) in the binding to two identical chromosomal targets. 
We use, then, our new insights on the "blocking factor" 
to derive biologically relevant implications and depict a 
comprehensive scenario of experimental evidences that 
highlights the implications of Xic colocalization with re- 
spect to the kinetics of XCI. 

Our model - In our model, for simplicity, we include 
just the essential components of the process we are inter- 
ested in (see FigJT]): the two relevant proximal portions 
of, say, the Xic, where the diffusible controlling factors 
are assumed to bind, and a portion of space surround- 
ing them. Such a volume includes an initially random 
distribution of molecular controlling factors originated 
by autosomes. These factors are represented by diffus- 
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ing particles having an affinity for their target regions on 
the two X chromosomes, as well as a reciprocal affinity 
among themselves, as they can form a complex. 

In our schematic description, the two Xic segments are 
parallel, at a given distance L in some units do (of the 
order of the unknown molecular factors size) , in a volume 
(a cubic lattice with spacing do) of linear sizes L x — 
2L, L y — L and L z — L around them (see Fig[T]). The 
diffusing factors randomly move from one to a nearest 
neighbor vertex on the lattice. On each vertex no more 
than one particle can be present at a given time [Toj ] . 

Each particle interacts with its nearest neighbors via 
an effective energy Eq. Below, we mainly discuss the 
case where Eo is of the order of a 'weak' hydrogen bond 
energy, say 6 kJ/mole, which at room temperature cor- 
responds to Eq = 2AkT [IH (the "random walk" model 
is recovered if Eq = 0). The probability of a particle 
to move to a neighboring empty site is proportional to 
the Arrhenius factor r$ exp(—AE/kT), where AE is the 
energy change in the move, k the Boltzmann constant 
and T the temperature [HI, [II] ■ The prefactor ro is the 
reaction kinetic rate (setting the time scale here), de- 
pending on the nature of the molecular factors and of 
the surrounding viscous fluid (for example, ro = 30sec _1 
is a typical value of biochemical kinetics). Finally since 
the Xic chromosome segments have an affinity for parti- 
cles, each lattice site belonging to the chromosomes has a 
binding energy Ex (equal for the two X's) with particles; 
for simplicity we take Ex — 2AkT too. 

The idea we illustrate below is that the molecular 
factors interaction, Eq 7 induces cluster formation (see 
FigO}: when a freely diffusing particle collides with a 
cluster of other particles, it tends to "stick" to them, 
which produce cluster growth. We show that if Eq is 
above a given threshold, E* (of the order of 'weak' hy- 
drogen bonds), a phase transition occurs and the many 
clusters eventually coalesce in a single major "complex" . 
Interestingly, the time, r, to form the complex rapidly 
grows with the X segments distance, L, explaining the 
important role of X colocalization. 

Computer Simulations - We studied by Monte Carlo 
simulations [l3| the dynamics and the final state attained 
by the system. The size of our lattice is L = l6do (we 
checked our results for L as large as 128do), with peri- 
odic boundary conditions. Averages are over at least 256 
runs and time is given in units of Monte Carlo lattice 
sweeps [l3j]. The fraction of particles per lattice site in 
the examples below (see Fig[T]) is c = 25 • 10~ 3 . 

Pictures of the system state at given time slices dur- 
ing a typical evolution are shown in Fig[TJ which com- 
pares the simple "random walk" model (Eq = 0) with 
the present model (E /kT = 2.4). The difference be- 
tween the two is apparent: in the "random walk" case, 
particles diffuse without forming any structure except for 
some binding to the 'chromosomes'. When Eq = 2AkT, 
clusters of particles form, which end up in a single big 



cluster covering only one of the 'chromosomes'. This phe- 
nomenon, similar to nucleation where DNA acts as a seed 
[3] , illustrates how the formation of a single "complex" 
and the spontaneous breaking of the binding symmetry 
between the two X chromosomes can occur. Notice that 
if the chromosome affinity tends to zero, Ex ~ * 0, a sin- 
gle cluster of particles is still eventually formed, but its 
binding to the X's is unlikely. 
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FIG. 1: We show the evolution of our particle system, around 
two equally binding "chromosomes", when the effective par- 
ticle interaction energy is Eo = 2AkT (left) and Eo — 
(right), starting from the same initial random configuration 
at t = 0. When Eo = 0, a "random walk" diffusion is found. 
The evolution is drastically different for Eq — 2AkT, where 
droplets of particles are formed, ending up in a single clus- 
ter covering just one of the two equivalent chromosomes and, 
thus, breaking their binding symmetry. 

FigH] shows the evolution, during the same kind of run 
of Fig[TJ of the average concentration around the chro- 
mosome on the left, pi, and on the right, p r (pi — Ni/Vi 
where Ni is the number of particles in a cylinder, of 
radius R = 2.5do and volume Vi = irR 2 L z , centered 
around the left chromosome; analogously p r — N r /V r ). 
In the "random walk" case, Eq = 0, the naive expecta- 
tion from the symmetry between the two chromosomes 
that pi(t) = p r (t) is indeed verified at all times, t; pi(t) 
and p r {t) slightly grow up to a value comparable to the 
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initial random one. In the Eo/kT = 2.4 case, pi and p r 
start from the same initial value, c, but at some point 
one of the two has a crisis as particles are all taken in the 
region of the other X whose local concentration gets one 
order of magnitude larger than at the beginning. 

The assembling of a single factor is attained when a 
balance is achieved between the entropy reduction and 
energy gain in the process: for a given concentration of 
the particles, c, only when the interaction energy, Eq, is 
above the phase transition line value E*(c) (broken line 
in the phase diagram of the right inset in FigJ2]) , a single 
major complex is formed and, thus, the original binding 
symmetry between the two chromosomes is broken. 

0.25 
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FIG. 2: The average concentration around the chromosome 
on the left, pi, and on the right, p r , of Fig[T]are shown as a 
function of time in a logarithmic scale. Circles refer to the 
case Eo = 2.4&T and squares to Eo = (filled symbols for 
pi, empty symbols for p r ). If Eo = 0, the symmetry between 
the two chromosomes is preserved during the evolution and 
pi(t) — p r (t). When Eo/kT = 2.4, the symmetry is broken: 
after a transient, pi grows one order of magnitude larger than 
at the starting point and p r — * 0. Right inset The system 
phase diagram, derived by simulations, is shown in the inter- 
action energy-concentration plane, (Eo,c), where the broken 
curve fits the phase transition line. Left inset We plot the 
time, t, to assemble the BF as a function of the distance L 
between the two X segments in a box of given size L x = lo, 
L v = l /2 and L z = l /2 (with l = 64d , E x = E = 2AkT 
and c = 25 ■ 10~ 3 ). Specifically, we show the ratio t(L)/tis, 
where Tie is the reference value for L = 16do, the distance 
considered in the rest of the paper. The fit is: r = a + bL 2 . 

Discussion - The SB mechanism we illustrated can 
explain why only one X chromosome per cell is active: 
it is the only one to recruit a sufficient number of con- 
trolling factors. The time, t, to complete the assem- 
bling of the blocking factor complex (BF) can be de- 
fined from the dynamics of the system "order parame- 
ter" , m(t) = \pi(t) — p r (t)\/(pi + Pr), which at long times 
is approximately exponential: m(t) oc exp(— t/r). Alike 
Brownian processes, r increases as the square of the X 
segment distance, L: r ~ L 2 (see left inset of FigJ2J). 
This suggests that only when the X's colocalize the BF 



can be assembled in a time short enough to be useful on 
biological time scales. 

Differences in the affinities of the X, Ex, induced for 
instance by deletions of binding sites on one of them, re- 
sult in a decisive breaking of the symmetry in particle 
binding: the X with less affinity remaining "naked" and, 
thus, unprotected from inactivation. This explains biased 
XCI in embryonic tissues resulting from allelic differences 
in Xic sequences, (e.g., in the Xce locus [15| or other re- 
gions [H, @, [I])- XCI in female cells and lack of XCI in 
male cells could derive from a similar mechanism. With 
respect to XCI in polyploid cells, in our simulations the 
higher is c the larger the probability to have, at interme- 
diate stages, two clusters, bound to two X, large enough 
to act as BF's. This could describe the stochastic nature 
of the 'X chromosome/autosome ratio effect' [H, [H, [jjj] . 

The SB model also rationalizes recent important dele- 
tion experiments across the Xic reg ion, known to affect 
choice and counting (see d, 0, \WL 17, 18, 0, [2(| and 
Ref.s therein). We summarize here, in particular, the 
phenotype of three deletions, which were instrumental in 
defining the role of the region 3' to Xist in counting and 
choice, namely A65kb[3, Tsi^ CpG % and Xite AL \v^. 

The A65kb deletion removes 65kb of DNA in the 
Xic region relevant to the chromosome activation 
A65kb causes non-random inactivation of the deleted X 
in heterozygous XX cells [l6j], and X inactivation in XY 
cells [18J. The explanation from BF models is that the 
A65kb deletion removes the binding sites for the block- 
ing factor and the complex cannot bind the X any more. 
Interestingly, the X chromosome bearing the deletion is 
not active, not even in male cells. 

The behavior of male cells is, however, drastically dif- 
ferent in the case of shorter deletions. The analysis of 
two smaller non-overlapping deletions within the above 
mentioned A65kb sequence, namely the Tsix ACpG dele- 
tion, removing the Tsix promoter, and the Xite AL mu- 
tation, removing Xite, added further important informa- 
tion. In heterozygous XX cells, the Ts«:r AC ' pG deletion 
causes non-random inactivation of the deleted X, whereas 
in XY cells the Tsix ACpG deleted X remains active^. 
The Xite AL phenotype is analogous 19]. Finally, in ho- 
mozygous Tsix ACpG XX mutants the choice of the active 
X is still random 21 1, but, importantly, in a fraction of 
cells both X's are inactivated ("chaotic counting" 

As usual single BF models cannot explain these re- 
sults [8], the simulations with the SB version we pro- 
pose (summarized in FigJ3]) allow a fresh look at the 
Tsii ACpG and Xite AL data. In our model, the block- 
ing factor is a cluster of transacting factors which can 
bind several sites on a chromosome at the same time. 
The Tsix ACpG deletion reduces the total blocking fac- 
tor/chromosome affinity. In our model, the difference 
in the affinity, Ex , of the wild type and of the deleted 
chromosome explains, as described before, why choice is 
skewed in the heterozygous XX cells (see Fig|3J\,B). At 
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variance with the results from the longer A65kb deletion, 
however, in the case of the smaller Tske ac ' pG and Xite AL 
deletions, the mutated X remains active in XY cells. This 
is easily understood within the SB model: if binding sites 
are found in both the regions deleted by Tsix ACpG and 
Xite AL , then each mutation will reduce the affinity of 
the chromosome for the blocking factor, though neither 
deletion will fully erase the overall affinity. Thus, in 
XY cells the blocking factor can still bind the deleted 
X chromosome (see Fig[3]C,D), since there is no compet- 
ing wild type X. Random choice in homozygous XX is 
explained as before (see Figj3fi!). "Chaotic counting" [8] 
in homozygous deleted mutants derives from an analo- 
gous mechanism: deletions significantly reduces the total 
X-chromosome/particles affinity (see FigUO^G) and, in a 
fraction of cells, the blocking factor doesn't bind at all. 
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FIG. 3: A pictorial summary of SB model results. Panels 
A,B) consider heterozygous XX cells with either a compara- 
tively long deletion in the Xic region (X iA hsay A65kb[3, 
or a shorter deletion (X sA ), say Tsix ACpG Q|. The mutated 
X, having a reduced overall affinity for the blocking factor 
(BF), looses on average the competition for it (skewed inac- 
tivation). C,D) In XY cells, the effects of X sA and X LA can 
be quite different: X LA being unable to bind the BF. E,F) 
In homozygous XX cells X s A , with a reduced affinity, suc- 
ceeds in binding the BF only in a fraction of cases ( "chaotic 
counting 

"01), whereas G) X iA is unable to bind BF. H) In 
XY cells, transgenic autosomes with long enough Xic inser- 
tions can bind the BF and the X is inactivated in a fraction 
of cases. 



Transgene insertions into autosomes have also been an- 
alyzed [la, |22|, |23| • When long Xic transgenes are intro- 
duced, in multiple copies 15j, into autosomes of male ES 



cells, inactivation of the single X occurs in a fraction of 
the cells 22|, |23[ . In our view the mutated autosomes can 
bind the BF and compete with the X for it, leaving the 
real X chromosome prone to inactivation (see Fig[3jr[) . 

The simple version of the SB model here discussed con- 
siders only a single kind of BF complex, though the model 
could easily accommodate more than one (in a Potts-like 
variant), as recently proposed in [8j. It does not either 
imply that only one kind of soluble factors is involved. 

Summarizing, the SB regulatory mechanism we pro- 
pose describes the self-assembling of the blocking factor 
from diffusible DNA binding molecules and explains why 
only one is formed, i.e., the binding symmetry of the 
two equivalent X's is broken. The emerging picture of 
its properties helps in delineating a scenario of biologi- 
cal implications reconciling within a single framework the 
existing experimental evidences (e.g., X colocalization). 
In our model "counting" and "choice" are no longer dis- 
tinct phenomena: they are regulated by the SB stochastic 
mechanism where time is an important parameter. More 
generally, the simplicity and robustness of the SB mech- 
anism, illustrated here for XCI, suggest it can be widely 
used in random monoallelic expression processes [24| . 
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